﻿ Masteratul de Lingvistică Computațională Curs: Introducere in Lingvistica Computațională Curs 6 Teorii ale discursului: Centering Curs: Dan Cristea Seminarii & proiect: Mihaela Onofrei, Dan Cristea The discourse layer INITIAL textSYNTACTIC SUB-SYNTACTIC PROCESSINGPROCESSINGPROCESSING SEMANTIC DISCOURSE PRAGMATIC PROCESSINGPROCESSINGresultPROCESSING The semantic layer INITIAL textSYNTACTIC SUB-SYNTACTIC PROCESSINGPROCESSINGPROCESSING SEMANTIC DISCOURSE PRAGMATIC PROCESSINGPROCESSINGresultPROCESSING DISCOURSE STRUCTURE COHESION & COHERENCE SUMMARISATION TEMPORAL PROCESSING QUESTION- ANSWERING What is discourse? Longman: 1 a serious speech or piece or writing on a particular subject: Professor Grant delivered a long discourse on aspects of moral theology 2 serious conversation between people: You can’t expect meaningful discourse when you two disagree so violently 3 the language used in particular kinds of speech or writing: scientific discourse What is discourse? Longman: 1 a serious speech or piece or writing on a particular subject: Professor Grant delivered a long discourse on aspects of moral theology 2 serious conversation between people: You can’t expect meaningful discourse when you two disagree so violently 3 the language used in particular kinds of speech or writing: scientific discourse As you can see, the definition 1 lets aside the dialogue Usually the dialogue is considered closely related to discourse and many techniques devised for discourse can be applied to dialogue as well In this presentation we will not refer to dialogue Text versus discourse Syntactically –a discourse is more than a single sentence From Garcia Marquez Text versus discourse A text is not a discourse! But it becomes a discourse the very moment it is read or heard by a human or a machine Time and discourse Discourse has a dynamic nature Time axes real time 12 discourse interpretation time 12 story time 21 10001030800920 Cohesion and coherence A text manifests cohesionwhen its parts closely correlate A text is coherentwhen it makes sense, with respect to an accepted setting, real or virtual Setting: a dynamic system of conventions Cohesion and coherence •Cohesion: usually enforced by anaphoric links, repetitions, etc (see Haliday and Hassan, 1978) •Coherence: rather easy to decide that a text is coherent, and very difficult to risk a statement of the contrary Recently, a friend of mine defied me that I am unable to give him a senseless sentence So, I uttered the famous Chomskyansentence “Colorless green ideas sleep furiously ”challenging him to find a sense And he did, because he explained me that this sentence simply says that one night some ideas (colorless, as all ideas) came, during an agitated sleep, to the mind of a politician, a member of the green party… So, the example argues for the necessity of a setting (or a context) according to which to give a meaning to a discourse Often the key to the interpretation of a discourse comes from finding this setting This is why to some people a novel like The sound and the fury of William Faulkner is obscure and difficult to read, while for others it is such a relish Interpretation of discourse knowledge about the language ledge about the situation textsettingknow knowledge about the author knowledge about the world (real+virtual) Thesettingexplainswhyonesentencefordifferent peoplecouldhavedifferentmeanings:eachonedecodes itinadifferentsetting(context) discourse interpretation intertextuality What do we expect from a theory of discourse? •To tell us how is the discourse structured •To make explicit, using this structure, some discourse phenomena: at least cohesion and coherence •To explain interruptions, flashbacks •To explain how the structure can be built from the raw text •To be easily put at the base of implementations that deal with discourse content Centering-a theory of local discourse coherence •Joshi,A K and Weinstein,S , 1981: “Control of Inference: Role of Some Aspects of Discourse-Structure Centering“ •Grosz,B ; Joshi,A K and Weinstein,S ,1986: “Towards a computational theory of discourse interpretation” •Brennan,S E ;Friedman,M W andPollard,C J ,1987: “A Centering approach to pronouns“ •Grosz,B ; Joshi,A K and Weinstein,S, 1995: “Centering: A framework for modeling the local coherence of discourse” •Strube,M and Hahn,U , 1996: “Functional Centering“ •Walker,M A ; Joshi,A K and Prince,E F (eds ), 1997: “Centering in Discourse“ •Kameyama,M , 1997: “Intrasentential Centering: A Case Study“ •Poesio, M , Stevenson, R , Di Eugenio, B and Hitzeman, J 2004: “Centering: A Parametric Theory and Its Instantiations” CT: goals of the theory •explains why certain texts are more difficult to process than others •explains why we use the pronouns the way we use them •anchors a practical approach for anaphora resolution A smooth discourse from (Walker, Joshi and Prince, 1997) a Jeff1helpedDick2washthecar b He1washedthewindowsasDick2waxedthecar c He1soapedapane Heinc isJeffbecausesoapingcanonlyberelatedtothe washingevent SeîncepeprinasevorbidespreJeffîn(a),în(b)secontinuădespreel,iarîn(c) constrângereasemanticădicteazăcăevorbadeacelașiJeff Procesareasefacelin A less smooth discourse a Jeff1helpedDick2washthecar b He1washedthewindowsasDick2waxedthecar c He2buffedthehood Heinc isDick,becausebuffingcanonlyberelatedtothe waxingevent CTpretindecăacestdiscursestemaigreudeprocesatdecâtprecedentul Oinferențădenatură semanticădicteazăcăîn(c)HeseleagălaDickiarnulaJeff Aiciapareocomutareaatențieidela Jeff,cefuseseîncentrulatențieiînprimeledouăfraze,laDickdesprecaresevorbeșteîn(c) O comutareacentruluiatențieiestemaigreudeprocesatdecâtopăstrarealui CT: focus helps to disambiguate pronominal anaphora from (Grosz, Joshi and Weinstein, 1995) 1 Susan1is a fine friend 2 She1gives people the most wonderful presents 3 She1just gave Betsy2a wonderful bottle of wine 4 She1told her2it was quite rare 5 She1knows a lot about wine Thereisnoproblemwithfindingwhosheisin2and3becausenoreferentotherthanSusanwereintroduced bythediscourse 3introducesBetsy,alsoafemalecharacter Neithersyntacticnorsemanticcriteriacanbe appliedin4todecidewhothereferentofsheandrespectivelyherare StillweeasilylinkshetoSusanandher toBetsybecausethefocusoftheprecedingutterancewasSusananditisnormaltoconsiderthatthefocusis preserved Asimilarcriteriaworksin5:morepeopleareinclinedtoconsidershehereasbeingSusanthan beingBetsy CT explainsthe normality or oddness of some utterances from (Grosz, Joshi and Weinstein, 1995) a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M “CT investighează interacţiunilece pot fi stabilite între alegerea expresiilor referențiale, starea atențională, inferențele necesare pentru determinarea interpretărilor unei exprimări într-un segment de discurs și coerență Pronumele şidescrierile definite nu sunt echivalente relativ la efectul pe care îl au asupra coerenței ”(citat din GJW95) This text has a defect… a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e He was sick and furious at being woken up so early După ce pînăla unitatea (d) focusul a fost menţinutconstant –Terry, abia în momentul în care sick apare cititorul realizează că heîn (e) nu mai e Terryci Tony, pentru că Terryn-ar fi putut fi bolnav Pentru un moment cititorul a fost derutat El a avut nevoie să facă o inferenţăpentru a repara confuzia This text has a defect… a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Hewas sick and furious at being woken up so early This text repairs the proceeding a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Tonywas sick and furious at being woken up so early O secvență mai naturală ar fi fost aceea în care în (e) s-ar fi folosit direct Tony And we can go on like this: a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Tony was sick and furious at being woken up so early f He told Terry to get lost and hung up But here we have again a problem a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Tony was sick and furious at being woken up so early f He told Terry to get lost and hung up g Of course, he hadn’t intended to upset Tony …dar cititorul este din nou derutat în unitatea (g) Focusul fusese schimbat din Terryîn Tonyîn unitățile (e) şi(f) și așteptarea era ca același personaj să rămână în continuare în centrul atenției But here we have again a problem a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Tony was sick and furious at being woken up so early f He told Terry to get lost and hung up g Of, course hehadn’t intended to upset Tony Finally, we are done! a Terry really goofs sometimes b Yesterday was a beautiful day and he was excited about trying out his new sailboat c He wanted Tony to join on a sailing expedition d He called him at 6 A M e Tony was sick and furious at being woken up so early f He told Terry to get lost and hung up g Of, course Terryhadn’t intended to upset Tony Însfârșitderutaesteeliminatăînaceastăvariantă ConjecturaCTestecăformadeexpresieîntr-undiscursinfluențeazădirectcererilederesurseutilizatela descifrarealui Estecunoscutcăidentificareareferențilorgrupurilornominaleîntr-undiscurspresupune unanumitprocesinferențial CTafirmăcăformaîncaresuntexprimateacestegrupurinominalepoate introduceoîncărcareinferențialămaimareorimaimicăîncititor CT: the main lines •Applies to just one segment of discourse –refers to Grosz&Sidner‘s Attentional State Theory •Sees the segment drawn up of adjacent utterances (sentences) •Discourse entities are called centers Centers Johngave Marya flower the realisation relation person1 person2flower1 type = person type = persontype = flower name = John name = Marynumber = sg gender = masc gender = fem What is a center? John went to Mary‘s house the realisation relationtherealisationrelation person1house1 type = persontype = house name = Marynumber = sg gender = fem He met herdown the street He found itdown the street For each utterance, compute: •a list of forward-looking centers: Cf(ui) = ranking: subject > direct-object > indirect-object > others •a backward-looking center: Cb(ui) = highest-ranked element of Cf(ui) that is -1 realised in ui •a prefered center: Cp(ui) = e1 Rule 1: pronoun realisation •If some element of Cf(ui) is realised as a -1 pronoun in ui, than so is Cb(ui) –it captures the intuition that pronominalisation is one way to indicate discourse salience –if there are multiple pronouns in a sentence realising discourse entities from the previous utterance, than Cb must be one of them –if there is just one pronoun, then the pronoun must be the Cb Rule 1 observed a Terry really goofs sometimes Cf= ([Terry]) b Yesterdaywasabeatifuldayandhewasexcitedabout tryingouthisnewsailboat Cf=(he=his=[Terry],[thesailboat]) Cb=[Terry] c HewantedTonytojoinonasailingexpedition Cf=(he=[Terry],[Tony],[theexpedition]) Cb=[Terry] d Hecalledhimat6A M Cf=(he=[Terry],him=[Tony]) Cb=[Terry] Rule 1 still observed a Terry really goofs sometimes Cf= ([Terry]) b Yesterdaywasabeatifuldayandhewasexcitedabout tryingouthisnewsailboat Cf=(he=his=[Terry],[thesailboat]) Cb=[Terry] c HewantedTonytojoinonasailingexpedition Cf=(he=[Terry],[Tony],[theexpedition]) Cb=[Terry] d HecalledTonyat6A M Cf=(he=[Terry],[Tony]) Cb=[Terry] Rule 1 disobserved a Terry really goofs sometimes Cf= ([Terry]) b Yesterdaywasabeatifuldayandhewasexcitedabout tryingouthisnewsailboat Cf=(he=his=[Terry],[thesailboat]) Cb=[Terry] c HewantedTonytojoinonasailingexpedition Cf=(he=[Terry],[Tony],[theexpedition]) Cb=[Terry] d Terrycalledhimat6A M Cf=([Terry],him=[Tony]) Cb=[Terry] Rule 2: transitions Following (Grosz, Joshi and Weinstein, 1995) and (Brennan, Friedman and Pollard, 1987) Cb(u) = Cb(u-1) Cb(u) ¹Cb(u-1) Cb(u) = Cp(u)CONTINUINGSMOOTH SHIFT Cb(u) ¹Cp(u)RETAININGABRUPT SHIFT CON > RET > SSH > ASHRule 2: a point of discontinuity a Terry really goofs sometimes CONTINUING Cf= ([Terry]) b Yesterday was a beatiful day and he was excited about trying out his new sailboat Cf= (he=his=[Terry], [the sailboat])CONTINUING Cb= [Terry] c He wanted Tony to join on a sailing expedition Cf= (he=[Terry], [Tony], [the expedition]) Cb= [Terry]CONTINUING d He called Tony at 6 A M Cf= (he=[Terry], [Tony]) SMOOTH SHIFT Cb= [Terry] e Tony was sick and furious at being woken up so early Cf= ([Tony]) Cb= [Tony] Rule 2: further analysis CONTINUING d HecalledTonyat6A M Cf=(he=[Terry],[Tony])SMOOTH SHIFT Cb=[Terry] e Tonywassickandfuriousatbeingwokenupsoearly Cf=([Tony]) Cb=[Tony]CONTINUING f HetoldTerrytogetlostandhungup Cf=(he=[Tony],[Terry]) RETAINING Cb=[Tony] g Of,courseTerryhadn’tintendedtoupsetTony Cf=([Terry],[Tony]) Cb=[Tony] Centering hints on pronominal anaphora a Ihaven’t seen Jefffor several days Cf=(I=[I],[Jeff]) Cb=[I] b Carlthinks he’s studying for his exams Cf=([Carl],he=[Jeff],[Jeff´sexams]) Cb=[Jeff] c Ithink he?went to the Capewith Linda from (Grosz, Joshi & Weinstein, 1983)Centering explains why we understand hein unit c as Jeff b Carlthinks he’s studying for his exams Cf=([Carl],he=[Jeff],[Jeff´sexams]) Cb=[Jeff] c Ithink he?went to the Capewith Linda Cf=(I=[I],he=[Jeff],[theCape],[Linda]) Cb=[Jeff] RETAININGCf=(I=[I],he=[Carl],[theCape],[Linda]) Cb=[Carl] ABRUPTSHIFTCentering explains why we understand hein unit c as Jeff b Carlthinks he’s studying for his exams Cf=([Carl],he=[Jeff],[Jeff´sexams]) Cb=[Jeff] c Ithink he?went to the Capewith Linda Jeff Cf=(I=[I],he=[Jeff],[theCape],[Linda]) Cb=[Jeff] RETAININGCf=(I=[I],he=[Carl],[theCape],[Linda]) Cb=[Carl] ABRUPTSHIFT Attentional state theory(AST) (Barbara Grosz & Candence Sidner, 1987) Models the linguistic structure of the discourse Gives an account on intentions and how they are combined Explains the shift of attention during discourse interpretation Explains interruptions and flash-backs Puts in evidence a dynamic domain of referentiality 3 components AST: 1stcomponent •a linguistic structure: –more sentences are aggregated in the same segment –segments display a recursive structure What locality means in CT view? •CTacceptăcadrulteoreticalteorieiASTcareexplicădiscursullanivel global CTafostelaboratăpentruaexplicaceseîntîmplăîninteriorulunui segment Astaînseamnăcăeavafiînstaresăexplicedeceanumitetranziţii întreunităţileunuisegmentsîntmaiuşordeprocesatdecîtaltele Dar,după cumştimdelaAST,îninteriorulunuisegmentputemregăsirelaţiide dominare,ceeaceînseamnăcăelareîncomponenţăaltesubsegmente Definiţiarecursivăasegmentuluiridicăînsăunsemndeîntrebarerelativla interpretarearestricţieidelocalitateîncareafostdefinităCT:atuncicînd, trecemgraniţadintreultimaunitateasegmentuluiBşiprimaunitatea segmentuluiC,aflateîntreeleînrelaţiedesatisfacere-precedenţă,atuncicînd amîndouăsîntdominatedeunsegmentmaicuprinzătorA,pedeoparte trecemgraniţaunuisegmentdecinuputemaplicaCT,darpealtăparte rămînemîninteriorulsegmentuluiA,deciartrebuisăputemaplicaCT… Centering: other problems? •still a local theory (applies inside a segment) •ranking of Cfellements –on what criteria (surface-order, syntactic role, functional (Strube&Hahn, 1996)) èlanguage dependant •null pronouns –Italian, Japanese, Romanian •clitics: doubling references –Romanian •intrasentensial centering (Kameyama, 1997) Centering -a parametric theory (Poesio et al, 2004)